Visual Classification of Co - Verbal Gestures for Gesture Understanding
نویسندگان
چکیده
A person's communicative intent can be better understood by either a human or a machine if the person's gestures are understood. This thesis project demonstrates an expansion of both the range of co-verbal gestures a machine can identify, and the range of communicative intents the machine can infer. We develop an automatic system that uses realtime video as sensory input and then segments, classifies, and responds to co-verbal gestures made by users in realtime as they converse with a synthetic character known as REA, which is being developed in parallel by Justine Cassell and her students at the MIT Media Lab. A set of 670 natural gestures, videotaped and visually tracked in the course of conversational interviews and then hand segmented and annotated according to a widely used gesture classification scheme, is used in an offline training process that trains Hidden Markov Model classifiers. A number of feature sets are extracted and tested in the offline training process, and the best performer is employed in an online HMM segmenter and classifier that requires no encumbering attachments to the user. Modifications made to the REA system enable REA to respond to the user's beat and deictic gestures as well as turntaking requests the user may convey in gesture. The recognition results obtained are far above chance, but too low for use in a production recognition system. The results provide a measure of validity for the gesture categories chosen, and they provide positive evidence for an appealing but difficult to prove proposition: to the extent that a machine can recognize and use these categories of gestures to infer information not present in the words spoken, there is exploitable complementary information in the gesture stream. Thesis Supervisor: Aaron F. Bobick Title: Associate Professor of Computational Vision Georgia Institute of Technology College of Computing Thesis Supervisor: Justine Cassell Title: AT&T Career Development Associate Professor of Media Arts and Sciences Doctoral Dissertation Committee Thesis Supervisor.......... ... ............................... Aaron F. Bobick Associate Professor of Computational Vision Georgia Institute of Technology College of Computing Thesis Supervisor ................... Justine Cassell Associate Professor of Media Arts and Sciences Program in Media Arts and Sciences Thesis Reader ......................... .... ............ ... Thomas S. Huang Professor of Electrical and Computer Engineering, Beckman Institute, University of Illinois at Urbana-Champaign
منابع مشابه
Hand Gestures Classification with Multi-Core DTW
Classifications of several gesture types are very helpful in several applications. This paper tries to address fast classifications of hand gestures using DTW over multi-core simple processors. We presented a methodology to distribute templates over multi-cores and then allow parallel execution of the classification. The results were presented to voting algorithm in which the majority vote was ...
متن کاملModulating the assessment of semantic speech–gesture relatedness via transcranial direct current stimulation of the left frontal cortex
BACKGROUND Co-verbal gestures are crucial for communication. Neuroimaging studies suggest that the left frontal lobe may be especially important for processing metaphoric co-verbal gestures. However, so far, the specific functional relevance of the left frontal lobe in metaphoric (abstract sentence content) co-verbal gesture processing compared to iconic (concrete sentence content) co-verbal ge...
متن کاملThe influence of body posture and gesture on the evaluation of verbal utterances addressment and comprehensibility
During everyday communication co-speech gestures represent a ubiquitous tool to underpin the verbal content of a message. In addition to gestures, other non-verbal information, such as the direction in which a speaker’s body is oriented, is particularly important during face-to-face interaction. However, the influence of bodily orientation (frontal vs. lateral) and gestures on the evaluation of...
متن کاملAn investigation of co-speech gesture production during action description in Parkinson's disease.
INTRODUCTION Parkinson's disease (PD) can impact enormously on speech communication. One aspect of non-verbal behaviour closely tied to speech is co-speech gesture production. In healthy people, co-speech gestures can add significant meaning and emphasis to speech. There is, however, little research into how this important channel of communication is affected in PD. METHODS The present study ...
متن کاملConceptual motorics: generation and evaluation of communicative robot gesture
How is communicative gesture behavior in robots perceived by humans? Although gesture is a crucial feature of social interaction, this research question is still largely unexplored in the field of social robotics. The present work thus sets out to investigate how robot gesture can be used to design and realize more natural and human-like communication capabilities for social robots. The adopted...
متن کامل